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This presentation outlined the concepts that are fundamental to the new 
Common Data Format. With PCDS Version 4.0, the Common Data Format (CDF) 
will supersede the Climate Data File (also CDF) of earlier versions. This 
new format incorporates generalizations in both design and terminology that 
make it applicable to multidisciplinary data sets. Furthermore, the new CDF 
will be made available to programmers as a software package that shields 
them from the low-level details of file formats. 

The CDF interface routines create an abstract conceptual environment for the 
scientific programmer. The principle concept for ’’visualizing" a CDF is 
known as the "basic grid.” A basic grid is an n-dimensional block by means 
of which a CDF is constructed. The size of the block may vary from one CDF 
to another, but is constant within any individual CDF. Thus, the basic grid 
serves as a fundamental uniform building block for a CDF. The number of 
grid dimensions and the size of each dimension are chosen by the scientist/ 
programmer to represent the patterns by which data are structured . For 
example, each cyclic independent variable would typically give rise to a 
basic grid dimension with size equal to the number of distinct values in the 
variable’s cycle. 

The uniform grid structure appears to the programmer to be propagated into 
each record for each variable. The CDF stores a variable value for each 
lattice point of the grid for each record. These data values are inserted 
and retrieved simply by specifying the variable's identifier, a record 
number, and the indices that specify the lattice point of interest. The CDF 
creation routines allow the programmer to specify which variables remain 
invariant with respect to each grid dimension or the record number. The 
software uses this information to avoid redundant storage of repeated data 
values . 
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THE COMMON DATA 
FORMAT (CDF) 


• MAKES FEASIBLE THE DISTRIBUTION OF DATA-SET-INDEPENDENT 
SOFTWARE TOOLS FOR ANALYSIS AND DISPLAY 


• PROVIDES A COMMON STRUCTURE FOR ACQUISITION AND 
DISTRIBUTION OF DATA 
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HIGHLIGHTS OF THE CDF 
SOFTWARE PACKAGE 


• SOFTWARE FOUNDATION FOR DATA MANAGEMENT AND ANALYSIS 

• DATA SET INDEPENDENCE 

• READILY APPLICABLE TO INTERDISCIPLINARY STUDIES 

• SINGLE MECHANISM FOR ORGANIZING DATA INTO MULTIDIMENSIONAL 
STRUCTURES CONSISTENT WITH SCIENTIFIC INTERPRETATION 

• SIMPLE ABSTRACT CONCEPTUAL ENVIRONMENT FOR SCIENTIFIC 
APPLICATIONS PROGRAMMER 
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USING THE CDF 
SOFTWARE PACKAGE 


YOU [DON'T | HAVE TO: 

• WRITE MESSY FORTRAN FORMAT STATEMENTS 

• DO MESSY RECORD NUMBER CALCULATIONS TO FIND LOCATION OF AN 
ARRAY VALUE 


ALL YOU HAVE TO [POl IS : 



CDF INTERFACE ROUTINES 


AND APPLY THEM TO CALLING THE 
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FUNDAMENTAL CDF 
CONCEPTS 


• Data Element 

• DATA RECORD 

• CDF BASIC GRID 

• CDF RANK 


VARIABLE 



SAMPLE DATA ELEMENT 
DESCRIPTION 


ELEMENT MNEMONIC: 

TEMP 

ELEMENT NAME: 

TEMPERATURE 

UNITS: 

DEG. K 

DATA TYPE: 

REAL 

RESOLUTION: 

0.6 

DISPLAY FORMAT: 

F7.3 

VALID RANGE: 

170. TO 290 
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DATA RECORDS IN SAMPLE 
DATA ENSEMBLE 


ELEMENT: 

TIME 

LONG 

LAT 

TEMP 

DATA RECORD #1 

0100 

150 W 

30 N 

200 

#2 

0100 

150 W 

40 N 

195 

#3 

0100 

165 W 

30 N 

196 

#4 

0100 

165 W 

40 N 

190 

#5 

0130 

150 W 

30 N 

203 

#6 

0130 

150 W 

40 N 

194 

#7 

0130 

165 W 

30 N 

195 

#8 

0130 

165 W 

40 N 

197 
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ATTRIBUTES OF SAMPLE 
DATA ENSEMBLE 


TIME 

LONG 

LAT 

TEMP 

BLOCKS OF 

CYCLIC 

CYCLIC 

VARIES 

4 

2 

2 

WITH 

IDENTICAL 

FIXED 

FIXED 

TIME. 

VALUES 

VALUES 

VALUES 

LAT, 

(165 W. 150 W) 

(30 N. 40 N) 

LONG 


THERE IS A 2-BY-2 GRID OF TEMPERATURE VALUES FOR 
EACH OBSERVATION TIME. 

LONG "GENERATES" ONE GRID DIMENSION. 

LAT "GENERATES" SECOND GRID DIMENSION. 
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SAMPLE DATA ENSEMBLE 
RESTRUCTURED 



TIME 

LONG 

LAT 



TEMP 

RECORD #1 




40 


196 

• 1 

195 


0100. 

165 W_ 

__,150 W 

30 N 


196 


200 

RECORD #2 

• 




> 

197 1 

r — i 

Vr V 

194 


0130 # 

i 


t 


195, 

> 

203 



INVAI 

RIANT 

1 

INVARIANT 





WITH 

WITH 






RECORD 

RECORD 





Number 

NUMBER 
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SAMPLE DATA ENSEMBLE 
CDF PROGRAMMER’S VIEW 



A 2-BY-2 BLOCK. 


TIME LONG LAT TEMP 

OlOOj TOIOO 165 W| Tl50 W AO Hit TaO N 190t |l95 

OlOoj [oiOO 165 w| |l50 W 30 n| ^0 N 19^ |200 

0130| |0130 165 lull |l50 W AO fAO N 197] Tl9A 

013o{ |oi30 165 W] ij.50 W 30 H|j [30 N 195J {203 

VALUES OF ELEMENTS THAT ARE INVARIANT WITH RECORD NUMBER 
APPEAR TO BE DUPLICATED FOR SUCCESSIVE RECORDS. 


VALUES OF ELEMENTS THAT ARE INVARIANT WITH RESPECT TO 
A BASIC GRID DIMENSION APPEAR TO BE DUPLICATED ACROSS 
THAT DIMENSION. 
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SAMPLE DATA ENSEMBLE 
CDF SPECIFICATIONS 


CDF RANK = DIMENSIONALITY OF BASIC GRID = 2 


VARIABLES 


ELEMENT: 

LLME 

LONG 

LAI 

mat 

1st DIMENSION VARIANCE: 

( r ) 

False 

TRUE 

false 

TRUE 

2ND DIMENSION VARIANCE: 

(♦) 

FALSE 

FALSE 

TRUE 

TRUE 

' 1 ' 

RECORD VARIANCE: 

TRUE 

FALSE 

False 

True 
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CDF DATA INSERTION, 
ACCESS, AND UPDATE 


TWO FORTRAN-77 ROUTINES: 

CDF_PUT (CDF_ID. VAR I ABLE_N UMBER , RECORD_NUMBER, 

BASIC_6RID_INDICES, DATA_VALUE, RETURN_STATUS_CODE) 


CDF_GET (CDF_ID, VARI ABLE_NUMBER, RECORD_NUMBER. 

BASIC_GRID_INDICES, DATA_VALUE. RETURN_STATUS_CODE ) 


ONCE DATA ELEMENT DICTIONARY IS DEFINED AND DATA ENSEMBLE STRUCTURE 
SPECIFIED; PROGRAMMER DOES NOT NEED TO KEEP TRACK OF DIMENSION AND 
RECORD VARIANCES. 
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CDF DATA ABSTRACTION 
SUMMARY 


CDF DATA ENSEMBLE APPEARS TO BE BUILT FROM MULTIPLE OCCURRENCES 
OF A SINGLE N-DIMENSIONAL BLOCK THAT IS CONSISTENT WITH THE 
SCIENTIFIC INTERPRETATION OF THE DATA. 


VALUES FOR DIFFERENT DATA ELEMENTS ARE CORRELATED SIMPLY BY USE 
OF IDENTICAL RECORD NUMBERS AND BASIC GRID INDICES. 


REDUNDANT PHYSICAL STORAGE OF DATA FOR CYCLIC VARIABLES IS 
ELIMINATED BY SPECIFICATION OF RECORD AND GRID DIMENSION 
VARIANCES. 



